Automatic speech recognition using acoustic confidence conditioned language models

نویسندگان

  • Richard C. Rose
  • Giuseppe Riccardi
چکیده

A modi ed decoding algorithm for automatic speech recognition (ASR) will be described which facilitates a closer coupling between the acoustic and language modeling components of a speech recognition system. This closer coupling is obtained by extracting word level measures of acoustic con dence during decoding, and making coded representations of these con dence measures available to the ASR network during decoding. A simulation of this decoding strategy is implemented using a word lattice rescoring paradigm. A joint acoustic{language model will be described where linguistic context is augmented to include the encoded values of acoustic con dence. Finally, the performance of the word lattice based implementation of the decoding algorithm will be evaluated on a large vocabulary natural language understanding task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Confidence Measures in HMM/MLP Hybrid Speech Recognition for Turkish Language

A confidence measure is defined as the posterior probability of word correctness given the values of confidence indicators [8]. Confidence measures can be calculated from a posteriori probability of recognized word or sub-word unit inferred from some acoustic models and language models, including various normalization techniques. In this work we present several confidence measures and propose t...

متن کامل

Comparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer

An automatic speech recognizer uses acoustic knowledge and linguistic knowledge. In large vocabulary speech recognition, acoustic knowledge is modeled by hidden Markov models (HMM), linguistic knowledge is modeled by N-gram (typically bi-gram or trigram), and these models are stochastically integrated. It is thought that humans also integrate acoustic and linguistic knowledge of speech when per...

متن کامل

Croatian Large Vocabulary Automatic Speech Recognition

This paper presents procedures used for development of a Croatian large vocabulary automatic speech recognition system (LVASR). The proposed acoustic model is based on context-dependent triphone hidden Markov models and Croatian phonetic rules. Different acoustic and language models, developed using a large collection of Croatian speech, are discussed and compared. The paper proposes the best f...

متن کامل

Asr for Automatic Directory Assistance: the Smada Project

In this paper we summarise the state-of-the-art for automatic speech recognition in automated Directory Assistance at the start of the 5th Framework project SMADA. Details are given about robust acoustic features for use in Distributed Speech Recognition, especially with respect to noise suppression. Then an overview is given of the confidence measures which are in use today, and their similari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999